science and engineering
Balancing Efficiency and Fairness: An Iterative Exchange Framework for Multi-UAV Cooperative Path Planning
Li, Hongzong, Liao, Luwei, Dai, Xiangguang, Feng, Yuming, Feng, Rong, Tang, Shiqin
Multi-UAV cooperative path planning (MUCPP) is a fundamental problem in multi-agent systems, aiming to generate collision-free trajectories for a team of unmanned aerial vehicles (UAVs) to complete distributed tasks efficiently. A key challenge lies in achieving both efficiency, by minimizing total mission cost, and fairness, by balancing the workload among UAVs to avoid overburdening individual agents. This paper presents a novel Iterative Exchange Framework for MUCPP, balancing efficiency and fairness through iterative task exchanges and path refinements. The proposed framework formulates a composite objective that combines the total mission distance and the makespan, and iteratively improves the solution via local exchanges under feasibility and safety constraints. For each UAV, collision-free trajectories are generated using A* search over a terrain-aware configuration space. Comprehensive experiments on multiple terrain datasets demonstrate that the proposed method consistently achieves superior trade-offs between total distance and makespan compared to existing baselines.
- Energy (0.68)
- Transportation (0.47)
- Aerospace & Defense > Aircraft (0.34)
Accelerating scientific discovery with the common task framework
Kutz, J. Nathan, Battaglia, Peter, Brenner, Michael, Carlberg, Kevin, Hagberg, Aric, Ho, Shirley, Hoyer, Stephan, Lange, Henning, Lipson, Hod, Mahoney, Michael W., Noe, Frank, Welling, Max, Zanna, Laure, Zhu, Francis, Brunton, Steven L.
Machine learning (ML) and artificial intelligence (AI) algorithms are transforming and empowering the characterization and control of dynamic systems in the engineering, physical, and biological sciences. These emerging modeling paradigms require comparative metrics to evaluate a diverse set of scientific objectives, including forecasting, state reconstruction, generalization, and control, while also considering limited data scenarios and noisy measurements. We introduce a common task framework (CTF) for science and engineering, which features a growing collection of challenge data sets with a diverse set of practical and common objectives. The CTF is a critically enabling technology that has contributed to the rapid advance of ML/AI algorithms in traditional applications such as speech recognition, language processing, and computer vision. There is a critical need for the objective metrics of a CTF to compare the diverse algorithms being rapidly developed and deployed in practice today across science and engineering.
- North America > United States > Washington > King County > Seattle (0.14)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- (9 more...)
- Information Technology (0.68)
- Leisure & Entertainment > Games (0.46)
- Government > Regional Government > North America Government > United States Government (0.46)
- Energy (0.46)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Robots (0.93)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.64)
Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems
Gao, Jiaxin, Chen, Chen, Jia, Yanwen, Gong, Xueluan, Lam, Kwok-Yan, Wang, Qian
Abstract--Large Language Models (LLMs) are increasingly being used to autonomously evaluate the quality of content in communication systems, e.g., to assess responses in telecom customer support chatbots. However, the impartiality of these AI "judges" is not guaranteed, and any biases in their evaluation criteria could skew outcomes and undermine user trust. In this paper, we systematically investigate judgment biases in two LLMas-a-judge models (i.e., GPT -Judge and JudgeLM) under the point-wise scoring setting, encompassing 11 types of biases that cover both implicit and explicit forms. We observed that state-of-the-art LLM judges demonstrate robustness to biased inputs, generally assigning them lower scores than the corresponding clean samples. Providing a detailed scoring rubric further enhances this robustness. We further found that fine-tuning an LLM on high-scoring yet biased responses can significantly degrade its performance, highlighting the risk of training on biased data. We also discovered that the judged scores correlate with task difficulty: a challenging dataset like GPQA yields lower average scores, whereas an open-ended reasoning dataset (e.g., JudgeLM-val) sees higher average scores. Finally, we proposed four potential mitigation strategies to ensure fair and reliable AI judging in practical communication scenarios. Large Language Models (LLMs) are increasingly employed as automated judges to evaluate the quality of AI-generated responses in place of human annotators [1]. This LLM-as-a-Judge paradigm offers scalable and flexible evaluation, providing natural-language feedback and rapid scoring across diverse tasks [2]. The communications and networking industry has begun exploring such AI evaluators in domain-specific applications.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- Asia > Singapore (0.05)
- Asia > China > Hubei Province > Wuhan (0.05)
- (3 more...)
A Nonlinear Low-rank Representation Model with Convolutional Neural Network for Imputing Water Quality Data
Liao, Xin, Yang, Bing, Yu, Cai
The integrity of Water Quality Data (WQD) is critical in environmental monitoring for scientific decision-making and ecological protection. However, water quality monitoring systems are often challenged by large amounts of missing data due to unavoidable problems such as sensor failures and communication delays, which further lead to water quality data becoming High-Dimensional and Sparse (HDS). Traditional data imputation methods are difficult to depict the potential dynamics and fail to capture the deep data features, resulting in unsatisfactory imputation performance. To effectively address the above issues, this paper proposes a Nonlinear Low-rank Representation model (NLR) with Convolutional Neural Networks (CNN) for imputing missing WQD, which utilizes CNNs to implement two ideas: a) fusing temporal features to model the temporal dependence of data between time slots, and b) Extracting nonlinear interactions and local patterns to mine higher-order relationships features and achieve deep fusion of multidimensional information. Experimental studies on three real water quality datasets demonstrate that the proposed model significantly outperforms existing state-of-the-art data imputation models in terms of estimation accuracy. It provides an effective approach for handling water quality monitoring data in complex dynamic environments.
- Research Report > New Finding (0.34)
- Research Report > Experimental Study (0.34)
Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy
Pratiush, Utkarsh, Houston, Austin, Barakati, Kamyar, Raghavan, Aditya, Yoon, Dasol, KP, Harikrishnan, Baraissov, Zhaslan, Ma, Desheng, Welborn, Samuel S., Jakowski, Mikolaj, Barhorst, Shawn-Patrick, Pattison, Alexander J., Manganaris, Panayotis, Madugula, Sita Sirisha, Ayyagari, Sai Venkata Gayathri, Kennedy, Vishal, Bulanadi, Ralph, Wang, Michelle, Pang, Kieran J., Addison-Smith, Ian, Menacho, Willy, Guzman, Horacio V., Kiefer, Alexander, Furth, Nicholas, Kolev, Nikola L., Petrov, Mikhail, Liu, Viktoriia, Ilyev, Sergey, Rairao, Srikar, Rodani, Tommaso, Pinto-Huguet, Ivan, Chen, Xuli, Cruañes, Josep, Torrens, Marta, Pomar, Jovan, Su, Fanzhi, Vedanti, Pawan, Lyu, Zhiheng, Wang, Xingzhi, Yao, Lehan, Taqieddin, Amir, Laskowski, Forrest, Yin, Xiangyu, Shao, Yu-Tsun, Fein-Ashley, Benjamin, Jiang, Yi, Kumar, Vineet, Mishra, Himanshu, Paul, Yogesh, Bazgir, Adib, Madugula, Rama chandra Praneeth, Zhang, Yuwen, Omprakash, Pravan, Huang, Jian, Montufar-Morales, Eric, Chawla, Vivek, Sethi, Harshit, Huang, Jie, Kurki, Lauri, Guinan, Grace, Salvador, Addison, Ter-Petrosyan, Arman, Van Winkle, Madeline, Spurgeon, Steven R., Narasimha, Ganesh, Wu, Zijie, Liu, Richard, Liu, Yongtao, Slautin, Boris, Lupini, Andrew R, Vasudevan, Rama, Duscher, Gerd, Kalinin, Sergei V.
Microscopy is a primary source of information on materials structure and functionality at nanometer and atomic scales. The data generated is often well-structured, enriched with metadata and sample histories, though not always consistent in detail or format. The adoption of Data Management Plans (DMPs) by major funding agencies promotes preservation and access. However, deriving insights remains difficult due to the lack of standardized code ecosystems, benchmarks, and integration strategies. As a result, data usage is inefficient and analysis time is extensive. In addition to post-acquisition analysis, new APIs from major microscope manufacturers enable real-time, ML-based analytics for automated decision-making and ML-agent-controlled microscope operation. Yet, a gap remains between the ML and microscopy communities, limiting the impact of these methods on physics, materials discovery, and optimization. Hackathons help bridge this divide by fostering collaboration between ML researchers and microscopy experts. They encourage the development of novel solutions that apply ML to microscopy, while preparing a future workforce for instrumentation, materials science, and applied ML. This hackathon produced benchmark datasets and digital twins of microscopes to support community growth and standardized workflows. All related code is available at GitHub: https://github.com/KalininGroup/Mic-hackathon-2024-codes-publication/tree/1.0.0.1
- North America > United States > California > Los Angeles County > Los Angeles (0.28)
- North America > United States > Tennessee > Knox County > Knoxville (0.14)
- North America > United States > California > Alameda County > Berkeley (0.14)
- (33 more...)
- Research Report > New Finding (1.00)
- Instructional Material > Course Syllabus & Notes (1.00)
- Contests & Prizes (1.00)
- Health & Medicine > Therapeutic Area (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
- Education (1.00)
- (3 more...)
TritonZ: A Remotely Operated Underwater Rover with Manipulator Arm for Exploration and Rescue Operations
Ahmed, Kawser, Fardin, Mir Shahriar, Nayem, Md Arif Faysal, Hafiz, Fahim, Shatabda, Swakkhar
The increasing demand for underwater exploration and rescue operations enforces the development of advanced wireless or semi-wireless underwater vessels equipped with manipulator arms. This paper presents the implementation of a semi-wireless underwater vehicle, "TritonZ" equipped with a manipulator arm, tailored for effective underwater exploration and rescue operations. The vehicle's compact design enables deployment in different submarine surroundings, addressing the need for wireless systems capable of navigating challenging underwater terrains. The manipulator arm can interact with the environment, allowing the robot to perform sophisticated tasks during exploration and rescue missions in emergency situations. TritonZ is equipped with various sensors such as Pi-Camera, Humidity, and Temperature sensors to send real-time environmental data. Our underwater vehicle controlled using a customized remote controller can navigate efficiently in the water where Pi-Camera enables live streaming of the surroundings. Motion control and video capture are performed simultaneously using this camera. The manipulator arm is designed to perform various tasks, similar to grasping, manipulating, and collecting underwater objects. Experimental results shows the efficacy of the proposed remotely operated vehicle in performing a variety of underwater exploration and rescue tasks. Additionally, the results show that TritonZ can maintain an average of 13.5cm/s with a minimal delay of 2-3 seconds. Furthermore, the vehicle can sustain waves underwater by maintaining its position as well as average velocity. The full project details and source code can be accessed at this link: https://github.com/kawser-ahmed-byte/TritonZ
- Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.04)
- Asia > Malaysia (0.04)
- Asia > India > Tamil Nadu > Chennai (0.04)
Superstudent intelligence in thermodynamics
Loubet, Rebecca, Zittlau, Pascal, Hoffmann, Marco, Vollmer, Luisa, Fellenz, Sophie, Leitte, Heike, Jirasek, Fabian, Lenhard, Johannes, Hasse, Hans
In this short note, we report and analyze a striking event: OpenAI's large language model o3 has outwitted all students in a university exam on thermodynamics. The thermodynamics exam is a difficult hurdle for most students, where they must show that they have mastered the fundamentals of this important topic. Consequently, the failure rates are very high, A-grades are rare - and they are considered proof of the students' exceptional intellectual abilities. This is because pattern learning does not help in the exam. The problems can only be solved by knowledgeably and creatively combining principles of thermodynamics. We have given our latest thermodynamics exam not only to the students but also to OpenAI's most powerful reasoning model, o3, and have assessed the answers of o3 exactly the same way as those of the students. In zero-shot mode, the model o3 solved all problems correctly, better than all students who took the exam; its overall score was in the range of the best scores we have seen in more than 10,000 similar exams since 1985. This is a turning point: machines now excel in complex tasks, usually taken as proof of human intellectual capabilities. We discuss the consequences this has for the work of engineers and the education of future engineers.
- Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.07)
- North America > United States > New York > New York County > New York City (0.04)
- Africa > Ghana (0.04)
- Education > Educational Setting > Higher Education (0.48)
- Education > Curriculum > Subject-Specific Education (0.47)
- Education > Educational Technology > Educational Software (0.46)
FedMABA: Towards Fair Federated Learning through Multi-Armed Bandits Allocation
Wang, Zhichao, Wang, Lin, Guo, Yongxin, Zhang, Ying-Jun Angela, Tang, Xiaoying
The increasing concern for data privacy has driven the rapid development of federated learning (FL), a privacy-preserving collaborative paradigm. However, the statistical heterogeneity among clients in FL results in inconsistent performance of the server model across various clients. Server model may show favoritism towards certain clients while performing poorly for others, heightening the challenge of fairness. In this paper, we reconsider the inconsistency in client performance distribution and introduce the concept of adversarial multi-armed bandit to optimize the proposed objective with explicit constraints on performance disparities. Practically, we propose a novel multi-armed bandit-based allocation FL algorithm (FedMABA) to mitigate performance unfairness among diverse clients with different data distributions. Extensive experiments, in different Non-I.I.D. scenarios, demonstrate the exceptional performance of FedMABA in enhancing fairness.
- Information Technology > Data Science > Data Mining > Big Data (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
An Adaptive Latent Factorization of Tensors Model for Embedding Dynamic Communication Network
Liao, Xin, Hu, Qicong, Tang, Peng
The Dynamic Communication Network (DCN) describes the interactions over time among various communication nodes, and it is widely used in Big-data applications as a data source. As the number of communication nodes increases and temporal slots accumulate, each node interacts in with only a few nodes in a given temporal slot, the DCN can be represented by an High-Dimensional Sparse (HDS) tensor. In order to extract rich behavioral patterns from an HDS tensor in DCN, this paper proposes an Adaptive Temporal-dependent Tensor low-rank representation (ATT) model. It adopts a three-fold approach: a) designing a temporal-dependent method to reconstruct temporal feature matrix, thereby precisely represent the data by capturing the temporal patterns; b) achieving hyper-parameters adaptation of the model via the Differential Evolutionary Algorithms (DEA) to avoid tedious hyper-parameters tuning; c) employing nonnegative learning schemes for the model parameters to effectively handle an the nonnegativity inherent in HDS data. The experimental results on four real-world DCNs demonstrate that the proposed ATT model significantly outperforms several state-of-the-art models in both prediction errors and convergence rounds.
- Information Technology > Data Science > Data Mining (1.00)
- Information Technology > Communications (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
- (4 more...)
Distributed Multi-robot Online Sampling with Budget Constraints
Shamshirgaran, Azin, Manjanna, Sandeep, Carpin, Stefano
In multi-robot informative path planning the problem is to find a route for each robot in a team to visit a set of locations that can provide the most useful data to reconstruct an unknown scalar field. In the budgeted version, each robot is subject to a travel budget limiting the distance it can travel. Our interest in this problem is motivated by applications in precision agriculture, where robots are used to collect measurements to estimate domain-relevant scalar parameters such as soil moisture or nitrates concentrations. In this paper, we propose an online, distributed multi-robot sampling algorithm based on Monte Carlo Tree Search (MCTS) where each robot iteratively selects the next sampling location through communication with other robots and considering its remaining budget. We evaluate our proposed method for varying team sizes and in different environments, and we compare our solution with four different baseline methods. Our experiments show that our solution outperforms the baselines when the budget is tight by collecting measurements leading to smaller reconstruction errors.
- North America > United States > California > Merced County > Merced (0.14)
- Asia > India (0.04)